doi: 1 0.1 046/j. 1 529-881 7.2004.00096.> 


Mitochondrial DNA Heterogeneity in Tunisian Berbers 

K. Fadhlaoui-Zid’*, S. Plaza 2 *, F. Calafell 2 , M. Ben Amor 1 , D. Comas 2 ** and A. Bennamar El gaaied 1 

J Laboratoire de Genetique Moleculaire. Immunologie et Biotechnologie, Faculte des Sciences de Tunis, Universite Tunis El Manar, 
2092 Tunis, Tunisia 

2 Unitat de Biologia Evolutiva. Universitat Pompeu Fabra. Doctor Aiguader 80, 08003 Barcelona, Spain 


Summary 

Berbers live in groups scattered across North Africa whose origins and genetic relationships with their neighbours 
are not well established. The first hypervariable segment of the mitochondrial DNA (mtDNA) control region was 
sequenced in a total of 155 individuals from three Tunisian Berber groups and compared to other North Africans. 
The mtDNA lineages found belong to a common set of mtDNA haplogroups already described in North Africa. 
Besides the autochthonous North African U6 haplogroup, a group of L3 lineages characterized by the transition 
at position 16041 seems to be restricted to North Africans, suggesting that an expansion of this group of lineages 
took place around 10500 years ago in North Africa, and spread to neighbouring populations. Principal components 
and the coordinate analyses show that some Berber groups (the Tuareg, the Mozabite, and the Chenini-Douiret) 
are outliers within the North African genetic landscape. This outlier position is consistent with an isolation process 
followed by genetic drift in haplotype frequencies, and with the high heterogeneity displayed by Berbers compared 
to Arab samples as shown in the AMOVA. Despite this Berber heterogeneity, no significant differences were found 
between Berber and Arab samples, suggesting that the Arabization was mainly a cultural process rather than a 
demographic replacement. 


Introduction 

Berbers inhabit scattered places in North Africa, from 
the Moroccan western coast to the oasis Siwa in Egypt, 
and from Tunisia in the north to the oases in the mid- 
Sahara. The origin of the Berber people is not clearly 
established. According to the archaeological record, 
North Africa has been peopled since Upper Palae- 
olithic times (Newman, 1995). The first well-defined 
Palaeolithic technology, the Aterian, dates back around 
40,000 years ago, and is followed by the Iberomau- 
risian (~ 22,000 years ago; Feremback, 1985; Close & 
Wendorf, 1990). The archaeological record then re- 
veals a Mesolithic culture, the Capsian (Brett & Fen- 
tress, 1996), which gave way to the Neolithic transition 
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to agriculture that occurred around 9,500-7,000BC, 
spreading from the Near East to Egypt (Dupanloup, 
1993). Berbers may be the descendants of Mesolithic 
Capsian populations, and/ or of the later Neolithic peo- 
ple who came from the Middle East via Egypt and who 
possibly introduced the Afro-Asiatic languages to North 
Africa (Renfrew, 1991). Since then, the North African 
coast has known several invasions: Phoenicians, Ro- 
mans, Vandals, Byzantines, Arabs, Ottomans, Spanish 
and French have occupied the territory, although their 
demographic impact is not well established. 

In Tunisia, the first well-known post-Neolithic inva- 
sion was that of the Phoenicians coming from the East 
Mediterranean sea coast around 1.100BC. Nonetheless, 
their number was estimated at the end of their kingdom 
to be 100,000 compared to 500,000 Berbers living in 
Tunisia (Julien, 1961). The long dominations by the Ro- 
mans, Vandals and Byzantines had an even lesser demo- 
graphic impact. The Arab conquest in Tunisia started in 
the 7th century and was followed by a massive Bedouin 
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immigration during the 11th century. During this in- 
vasion, Berbers were restricted to Numidia (the Cen- 
tre and South of Tunisia). The Arab expansion largely 
submerged the original Berber language and customs, 
except for the tribes that were forced back to the moun- 
tains and certain villages located in Southern Tunisia. 
Between the sixteenth and twentieth centuries Tunisia 
was ruled by Turks, Spaniards and French. Other im- 
migrants came from the South (Sub-Saharan African 
slaves from Sudan). All of these populations probably 
contributed to the present Tunisian gene pool. 

Nowadays, the only criterion used to distinguish the 
Berbers from the rest of the Tunisian population is the 
language called Chelha. Berbers who speak Chelha in 
Tunisia are 1% of the global population, and are localized 
in four villages in the South of Tunisia (Sened, Matmata, 
Chenini and Douiret) and in the island of Jerba. 

Mitochondrial DNA (mtDNA) is a powerful tool 
in reconstructing population history, because a fine- 
grained phylogeography has been defined for it. This is 
particularly relevant in Northern Africa, which is at the 
crossroads of Europe, the Middle East, and Sub-Saharan 
Africa, regions for which mtDNA phylogeography is 
known in detail (Macaulay et al. 1999; Richards et al. 
2000; Salas et al. 2002). 

Several genetic studies have been performed in North 
African populations, although very few in Tunisian 
Berbers. The compilation of classical genetic markers 
in North Africa (Bosch et al. 1997) showed a clear ge- 
netic differentiation between East and West, attributed 
to human expansions from the Middle East, such as 
Palaeolithic and/or Neolithic demographic expansions. 
Analyses of autosomal STRs (Bosch et al. 2000) and Alu 
insertion polymorphisms (Comas et al. 2000) showed 
some Sub-Saharan genetic flow into NW African popu- 
lations. High-resolution analysis ofY-chromosome bial- 
lelic and STR markers (Bosch et al. 2001) has revealed 
a clear North African genetic differentiation compared 
to Europe, due to a major independent Upper Palae- 
olithic contribution in both areas, followed by gene flow 
from the Near East during the Neolithic, and small bidi- 
rectional gene flow across the Mediterranean. Finally, 
mtDNA in North Africa has been analysed mainly in 
the Western coast (Rando et al. 1998; Brakez et al. 2001; 
Plaza et al. 2003) and Egypt (Krings et al. 1999), and has 
been studied only in a non-Berber population in Tunisia 
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(Plaza et al. 2003). Previous mtDNA analyses have sug- 
gested that modern Berbers are the descendants of the 
earlier groups living in North Africa in Palaeolithic times 
(Rando et al. 1998; Macaulay et al. 1999). The anal- 
ysis of GM and KM haplotypes in Tunisian Berbers 
(Chaabani et al. 1984) suggested some heterogeneity 
within Berbers; thus mtDNA analysis will contribute 
significantly to our knowledge of the genetic pool of 
Tunisian Berbers. 

In the present study we have analyzed mtDNA HVS- 
I sequences in three Berber isolates from Southern 
Tunisia, with the aim of evaluating the possible het- 
erogeneity of these Berbers in relation to genetic drift, 
comparing them to several Arabic and Berber-speaking 
populations from North Africa, and determining the 
genetic contributions of surrounding populations to the 
Tunisian Berbers, to trace their population history. 


A total of 155 Berbers from Tunisia were analysed for 
the hypervariable segment I (HVS-I) of the mtDNA 
non-coding region. Blood samples were collected from 
four villages (Figure 1): 53 from Sened (also known as 
Sundia), 49 from Matmata (or Matmatia), and 53 from 
Chenini (Chenenaouia) and Douiret (Douiria). Chenini 
and Douiret are two neighbouring villages 20 km from 
each other; samples from these two villages were pooled 
and treated as a single population. Blood samples were 
collected according to geographic and linguistic crite- 
ria: donors were Berber Chelha speakers born in one 
of the four villages mentioned above. DNA extrac- 
tion was performed using a standard phenol-chloroform 
method. 


The HVS-I was amplified using the primers LI 5996 
and H16401 as described in Vigilant et al. (1989). 

PCR products were purified with the QiAEX II KIT 
(Qiagen). The sequencing reaction was performed 
using the Big Dye Terminator (version 3.0) Cy- 
cle Sequencing Kit, with AmpliTaq® DNA Poly- 
merase (Applied Biosystems). Sequences were run 
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Figure 1 Geographical location of the localities sampled. 

in an automatic Sequencer ABI377 (Applied Biosys- 
tems). Sequence analyses were performed from po- 
sitions 16024 to 16391 according to the Cam- 
bridge Reference Sequence (CRS; Anderson et al. 
1981; Andrews et al. 1999), and are available at 
http://www.upf.es/cexs/recerca/bioevo/index.htm 
In addition to the control region sequence, four single 
nucleotide polymorphisms (SNPs) in the mtDNA cod- 
ing region (positions 10400, 10873, 12308, and 12705) 
were determined in those individuals that were difficult 
to classify unambiguously into haplogroups based only 
in the information provided by the HVS-I. A single-base 
primer elongation method was used to genotype these 
four positions, and primers and PCR conditions are 
described elsewhere (Comas et al. 2004). Information 
yielded by each of the four SNPs allowed us to assign 
sequences to one of four major haplogroups: 10400T 
defines the major M haplogroup, 10873C defines the 
major L haplogroup, 12308G defines the major U hap- 
logroup (including the K haplogroup), and 12705C de- 


fines the major R haplogroup, which includes a large 
set of haplogroups (H, V, J, T, U, B and F). 

Phylogenetic Analyses 

DNA sequences were aligned using the CLUSTAL V 
program. Sequences were classified into haplogroups ac- 
cording to the nomenclature of Richards et al. (2000) 
and Salas et al. (2002). Genetic diversity measures were 
calculated (from position 16024 to 16383) with the Ar- 
lequin package 2.0 (Excoffier et al. 1992) and com- 
pared to those in a set of North African populations 
(Table 1). Analyses of molecular variance (AM OVA) 
were also performed with Arlequin 2.0. Genetic dis- 
tances between populations using the HVS-I were cal- 
culated by intermatch-mismatch pairwise differences 
according to the equation D — dy — (d,, + d }j )/2 (Nei, 
1987), where dy is the mean pairwise differences be- 
tween populations i and j, and d„ and d }] are the 
mean pairwise differences within populations i and j 
respectively. The distance standard errors were com- 
puted by resampling nucleotide positions with 1,000 
bootstrap iterations (Efron, 1982). The distance ma- 
trix was also employed to build a multidimensional 
scaling using the STATISTICA 6.0 package. Prin- 
cipal component and correspondence analyses were 
obtained from haplotype frequencies using the SPSS 
package. 

The network relating HVSI sequences within some 
of the haplogroups described was constructed by us- 
ing a reduced-median algorithm (Bandelt et al. 1995) 
as implemented in the Network 3.0 program. The dat- 
ing method employed (Saillard et al. 2000) is based on 
the average number of mutations accumulated from an 
ancestral sequence as a linear function of time and mu- 
tation rate. This method was also performed with the 
Network 3.0 program. 

Results 

Haplogroup assignation was performed comparing 
HVS-I sequences with the data set of Richards et al. 
(2000) and Salas et al. (2002), and their classification 
was used. All Chenini-Douiret sequences were classi- 
fied unambiguously using the information yielded by 
the HVS-I variable positions. One sequence from Sened 
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Populations 

Code 

Ref. 

n 

H (SE) 


it 

Matmata Berbers 

TBM 

1 

49 

0.964 

± 

0.021 

5.05 

Sened Berbers 

TB S 

1 

53 

0.975 

± 

0.011 

7.53 

Chenini-Douiret Berbers 

TB CD 

1 

53 

0.939 

± 

0.017 

6.82 

Tunisian Arabs 

Tun 

2 

47 

0.990 

± 

0.009 

6.15 

Mozabites 

Moz 

3 

85 

0.942 

± 

0.010 

4.73 

Algerian Arabs 

Alg 

2 

47 

0.956 

± 

0.014 

5.72 

Center Moroccan Berber 

CB1 

2, 4 

64 

0.968 

± 

0.013 

4.51 

Center Moroccan Berber 

CB2 

8 

60 

0.984 

± 

0.009 

6.00 

South Moroccan Berbers 

SB 

5 

50 

0.961 

± 

0.018 

4.60 

Moroccan Arabs 

MA 

2, 4 

50 

0.993 

± 

0.006 

7.04 

Mauritanians 

Mau 

4 

30 

0.975 

± 

0.017 

6.09 

Egyptians 

Eg 

6 

68 

0.992 

± 

0.005 

7.06 

Saharawi 

Sah 

2, 4 

81 

0.982 

± 

0.006 

5.44 

Tuareg 

Tg 

7 

26 

0.985 

± 

0.014 

7.10 


H: sequence diversity 

71 : average number of pairwise differences 

References: 1: Present study; 2: Plaza et al. (2003); 3: Corte-Real et al. (1996), 4 : Rando 
et al. (1998), 5: Brakez et al. (2001), 6 : Krings et al. (1999), 7: Watson et al. (1996), 8 : 
Thomas et al. (2002) 


Table 1 Genetic diversity parameters in 
North African samples 


and nine from Matmata that could not be unambigu- 
ously assigned into haplogroups based only on the data 
from the control region were typed for positions 10400, 
10873, 12308, and 12705 in the coding region. From 
these, the sequence from Matmata with the following 
mutational pattern 16232-16293-16356 could not be 
typed for these four positions due to lack of DNA, and 
was classified as “other.” Haplogroup frequencies are 
reported in Table 2. 

Sequence Gene Pool in Tunisian Berbers 

The Tunisian Berber mtDNA gene pool is constituted 
by sequences belonging to several major haplogroups: 
the sub-Saharan African L group, the east African Ml 
haplogroup, and the West Eurasian major haplogroups 
N and R. 

Sub-Saharan sequences, represented by haplogroups 
LI, L2, and L3, are found in the three Tunisian Berber 
populations at high frequencies: 26.6% in the Berbers 
from Sened, 24.3% in the Berbers from Matmata, and 
13.3% in the Berbers from Chenini-Douiret, show- 
ing a large sub-Saharan gene flow among these three 
Berber populations. Most of the sub-Saharan sequences 
found were not described in the data sets of Salas 
et al. (2002) , although most of them differ from an al- 
ready described sequence by one or two mutation steps. 
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Noteworthy is the presence of eight sequences, out of 
the 17 L3 sequences found in Tunisian Berbers, that 
harbour a transition at nucleotide position 16041. This 
transition has been found in other Berber and Arab sam- 
ples from North Africa, and its presence is negligible in 
other populations (two South-Western Europeans, three 
West Africans and one East African), suggesting a North 
African origin for this group of sequences, although a 
sub-Saharan origin cannot be rejected. The network of 
sequences bearing the 16041 transition (Figure 2) shows 
a clear star-like phylogeny and a recent origin for this 
group, at around 10500 years (SE 3500 years). 

The Ml haplogroup, to which an East- African origin 
is attributed (Quintana-Murci et al. 1999), is represented 
by a single sequence in six Sened Berbers (10.9%), in 
one individual from Matmata, and it is absent in Berbers 
from Chenini-Douiret. This sequence was found among 
Arabs from Tunisia but not in Algerians (Plaza et al. 

2003). This sequence was also found in other popula- 
tions (Quintana-Murci et al. 1999; Richards et al. 2002) 
with a transversion at position 16183, which may be 
the result of the hypervariable length polymorphism 
in the poly-C tract of the control region (Bendall & 

Sykes, 1995). The presence of a single Ml sequence at 
high frequency in Tunisian Berbers could be explained 
by gene flow from East Africa, followed by genetic 
drift. 
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Table 2 MtDNA haplogroup frequencies in Tunisia 


Haplogroups 

TB CD 
N = 53 

TB S 
N = 53 

TB M 
N = 49 

Tunisian Arabs 3 
N = 47 

Lib* 

_ 

_ 

2.0 

_ 

Llbl 

- 

5.7 

- 

- 

Llcl 

3.8 

- 

- 

- 

L2* 

- 

1.9 

- 

2.1 

L2a 

1.9 

1.9 

2.0 

4.3 

L2al 

- 

3.8 

- 

4.3 

L2ala 

1.9 

3.8 

2.0 

- 

L2bl 

- 

- 

- 

2.1 

L3* 

- 

1.9 

16.3 

2.1 

L3b 

3.8 

3.8 

- 

- 

L3bl 

- 

1.9 

- 

- 

L3d 

- 

- 

2.0 

2.1 

L3el 

- 

- 

- 

2.1 

L3e2 

- 

1.9 

- 

4.3 

L3f 

1.9 

- 

- 

4.3 

Ml 

- 

11.3 

2.0 

4.3 

Nib 

1.9 

- 

8.2 

- 

12 

- 

7.5 

- 

- 

W 

- 

- 

- 

2.1 

X 

- 

- 

- 

2.1 

pre-HV 

- 

5.7 

- 

- 

HV 

15.1 

- 

- 

6.4 

H* 

13.2 

24.5 

26.5 

23.4 

V 

- 

- 

16.3 

- 

J* 

3.8 

3.8 

2.0 

- 

J1 

- 

- 

2.0 

2.1 

J2 

- 

- 

- 

2.1 

T* 

1.9 

1.9 

- 

- 

T1 

30.2 

1.9 

4.1 

2.1 

T2 

- 

- 

- 

2.1 

T3 

- 

- 

- 

2.1 

K* 

5.7 

- 

4.1 

4.3 

K2 

9.4 

- 

- 

2.1 

U* 

- 

1.9 

- 

- 

Ula 

- 

3.8 

- 

4.3 

U3 

5.7 

- 

2.0 

- 

U5* 

- 

1.9 

- 

- 

U5ala 

- 

1.9 

- 

- 

U5b 

- 

- 

- 

2.1 

U6a* 

- 

1.9 

2.0 

4.3 

U6al 

- 

5.7 

- 

- 

U7a 

- 

- 

4.1 

- 

Other 

- 

- 

2.0 

6.4 


“Data from Plaza et al. 2003 


The U6 haplogroup originated in N Africa ~ 40,000 
years ago (Macaulay et al. 1999) and is found in Mo- 
roccan Berbers at 6-8%, reaching 28% in the isolated 
Mozabites from Algeria. In Tunisian Berbers it is found 
at 7.6% in Sened, 2.0% in Matmata and was absent in 
Chenini-Douiret; it has been found at 4.2% in Tunisian 
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Arabs (Plaza et al. 2003). Thus, U6 frequencies in 
Tunisian Berbers are relatively low and may mark an 
eastward decline in the frequencies of this haplogroup. 
All U6 lineages found in Tunisian Berbers belong to 
the U6a (characterised by 16278T), and U6al (charac- 
terised by 16189C and by 16278T) subgroups. 

Haplogroup U/K, excluding U6 sequences, was 
found in 9.5% of the Sened sequences, 10.2% in the 
Matmata and 20.8% in Chenini-Douiret. Haplogroup 
U is represented by Ula, U3, U5, U5ala, U7a, and K 
sequences. The high frequency of this haplogroup in 
the Chenini-Douiret sample is due to the presence of 
eight sequences belonging to haplogroup K. Three of 
them belong to the K root type, and the other five to 
the haplogroup K2, which is represented by two hap- 
lotypes. Haplogroup U7a reached a frequency of 4.1% 
in the Berbers from Matmata. This haplogroup is found 
in Middle Eastern populations such as Iraqis, Palestini- 
ans, Armenians, Druze and Kurds; and some South- 
ern and Eastern Europeans (Richards et al. 2000), but 
none of the already described sequences matched with 
those found in the Matmata Berbers. U7a sequences 
may have been integrated into Matmata Berbers after 
a wave of migration from the Middle East, as they dif- 
fer by only one step mutation from the U7a Middle 
Eastern sequences, whereas three mutation steps sep- 
arate them from the European U7a sequences. Hap- 
logroup U3 is mainly found in populations from the 
Middle East (Richards et al. 2000), and was also re- 
ported in Chenini-Douiret Berbers at 5.7%, and in 
Matmata Berbers at 2.0%. But no match was found 
between Tunisian Berbers and Middle Eastern U3 se- 
quences, nor between U3 sequences among Tunisian 
Berbers. 

H is the most frequent haplogroup in most West 
Eurasian (Richards et al. 2000) and North African (Plaza 
et al. 2003) populations, as well as in Tunisian Berbers, 
where H sequences (and HV sequences, since some- 
times they cannot be directly distinguished neither by 
HVS-I nor by the four coding positions typed in the 
present study, and the typing of position 7025 would be 
recommended) are found at a high frequency: 28.3% 
in the Berbers from Chenini-Douiret, 24.5% in the 
Berbers from Sened, and 26.5% in the Berbers from 
Matmata. Haplogroup V, which is largely distributed in 
Western Mediterranean populations (6% in NW Africa, 
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Figure 2 Phylogenetic network of a section ofhaplogroup L3 bearing a transition in 
position 16041. The size of the circles is proportional to the number of sequences. 
Mutated sites (minus 16000) are indicated along the lines. Transversions are indicated by 
the nucleotide after the number. The sequence marked with an asterisk differs from the 


Cambridge Reference Sequence by transitions at positions 16041 
5% in Iberian Peninsula, and 3,2% in Italy; Plaza et al. 
2003), is found only in the Matmatia with a high fre- 
quency of 16.3%. Haplogroup V is absent in the two 
other Tunisian Berber populations, as is also the case for 
Tunisian and Algerian Arabs. This value is comparable 
with that found in the Saharawi (17.9%) and higher than 
those found in the Basques (10.4%) and other Western 
European samples (Plaza et al. 2003). 8.2% of the V se- 
quences of the Matmata Berbers bear only the variant 
16298C, and the rest of the sequences bear one addi- 
tional variant and did not match with previously de- 
scribed V sequences (Richards et al. 2000). It is also 
relevant to note the high frequency ofhaplogroup HV1 
(15.1%) in Berbers from Chenini-Douiret, represented 
by a unique sequence. 

Haplogroup J is found at very low frequencies in the 
three samples, and except for one Matmata sequence 
that can be classified into thejl group, the rest of se- 
quences remain in the J root type. All the T sequences 
found in the present sample set, except one individual 
from Sened and one from Chenini-Douiret who bear a 
T root type, belong to the T1 subgroup. The distribu- 


d 16223. 

tion ofhaplogroup T in the three Tunisian Berber sam- 
ples is very different. The frequency of T sequences is 
very low in Matmata (4.1%) and Sened Berbers (3.8%), 
whereas haplogroup T is represented in the Chenini- 
Douiret sample by 17 individuals (32.1%) carrying six 
different haplotypes. Of those, the most frequent was 
found in nine individuals and contains a deletion at po- 
sition 16193, which was also observed in a single one- 
step derivative. This sequence type, without the dele- 
tion, was shared with Moroccan Berbers, Mozabites and 
Egyptians, and is also frequent in Europe and the Middle 
East. 

Other haplogroups that are mainly found in the Mid- 
dle East are also present in Tunisian Berbers: 8.2% of 
Matmata Berbers and one individual from Chenini- 
Douiret belong to Nib haplogroup, and 7.5% of Sened 
Berbers displayed 12 sequences. 

Population Structure 

An analysis of the molecular variance (AMOVA) was 
performed with the sample data set described in 
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Table 3 Analyses of Molecular Variance 
(AMOVA) in North African samples 


Groups 

Among 

groups 

Among populations 
within groups 

Within 

populations 

All populations 


4.14“ 

95.86“ 

Arabs 


1.54“ 

98.46” 

Berbers 


6.24“ 

93.76“ 

Moroccan Berbers 


2.47“ 

97.53“ 

Tunisian Berbers 


6.67“ 

93.33“ 

Arabs vs Berbers 

-0.21 ns 

4.25“ 

95.95“ 

Arabs vs Berbers 3 
Moroccan Berbers 

0.04 ns 

3.10“ 

96.86“ 

vs Algerian (Moz) 
vs Tunisian Berbers 
Moroccan Berbers 

0.60 ns 

4.98 

94.42“ 

vs Tunisian Berbers 

-0.16 ns 

4.79“ 

95.37“ 

Morocco vs Algeria-Tunisia 

-0.34 ns 

4.36“ 

95.98“ 


“(p < 0.01); *(p < 0.05); ns : not significant 
“Mozabites and Tuareg excluded from the Berber group 


Table 1. When all populations were considered as a 
single group, 4.14% of the variance was attributed 
to differences among populations, and 95.86% of 
the genetic variance was found within populations 
(Table 3). These populations were more diverse than 
sub-Saharan and West Mediterranean populations, 
where 98.8% (Salas et al. 2002) and 97.4% (Plaza et al. 
2003) of the variance was found within the respec- 
tive population sets. However, this could be due to 
the sampling of one or a few outlier populations that 
are not diluted in a high number of populations, such 
those used in the sub-Saharan and West Mediterranean 
analyses. 

Several population groups based on cultural and ge- 
ographic criteria were performed in order to detect 
some population structure. Populations were first clus- 
tered according to linguistic and cultural criteria (Arab 
vs. Berber), and AMOVA was performed within each 
group. Berber populations were more diverse from each 
other than Arabs were: 6.24% of the genetic vari- 
ance was due to differences among Berber samples 
whereas 1.54% was attributed to differences among 
North African Arabs. When these two groups were 
compared, genetic variation between Arabs and Berbers 
was non-significantly different from 0, and 4.25% of 
the genetic variance was due to differences within 
Arabs and Berbers. When the analysis was also per- 
formed removing Mozabites and Tuaregs, known iso- 
late Berbers where drift has acted strongly, the vari- 


ation among groups remains non-significant, whereas 
the variation within Arabic and Berber groups de- 
creases to 3.10% (p < 0.01). This result shows that, 
even when known outliers are removed, a certain de- 
gree of heterogeneity remains within groups, whereas 
no genetic differences are revealed between Arabs and 
Berbers. 

We next classified Berber populations according to 
their geographic origin (Morocco, Algeria, and Tunisia), 
and no significant differences were found among groups. 
Nonetheless, the variance among populations from the 
same country was significantly high (4.98%, p < 0.01), 
reflecting heterogeneity among the Berber populations 
from the same geographical area. Since Mozabites are 
known outliers that could contribute disproportionately 
to interpopulation variance, they were removed from 
the analysis (Table 3). The apportionment of the vari- 
ance due to differences within groups dropped to 4.79%, 
but was still highly significant (p < 0.01). 

In order to detect a possible east-west differentia- 
tion in Northwest African samples suggested previously 
(Plaza et al. 2003), two geographical groups (Morocco 
versus Algeria-Tunisia) were formed, pooling Arabs and 
Berbers (Table 3). 4.36% (p < 0.01) of the genetic 
variance was attributed to differences within groups, 
whereas no significant differences were found between 
groups. 

In summary, AMOVA showed that Tunisian Berbers 
are highly heterogeneous populations. 
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Tunisian Berbers within the North African 
Genetic Landscape 

Measures of genetic diversity are reported in Table 1 . 
Berbers from Chenini-Douiret present the lowest se- 
quence diversity value of the geographical region con- 
sidered, even lower than the diversity presented by the 
Mozabites, although its confidence interval overlaps 
widely with all others. However, the mean pairwise dif- 
ference in Chenini-Douiret is not lower than in other 
populations. This result points to a micro geographic 
differentiation among the Tunisian Berbers due to ge- 
netic drift, since some of the sequences (coming from a 
variety of haplogroups, hence the relatively high average 
pairwise difference) have high frequencies in the sam- 
ple. When compared with the rest of the North African 
samples, the two other Tunisian Berber populations dis- 
played haplotype diversity values that are in the range of 
the observed values. 

In order to establish the genetic relationship between 
the Tunisian Berbers and the other of North African 
populations, the genetic distance matrix between pop- 
ulations based on individual sequences was represented 
as a bidimensional plot by means of multidimensional 
scaling (MDS; Figure 3). The MDS plot isolates the 


Tuareg, the Mozabites and the Berbers from Chenini- 
Douiret from the rest of the populations, placing each 
of them at one extreme of the plot. The Berbers from 
Matmata are close to the South and Central Moroccan 
Berbers, whereas the Berbers from Sened are adjacent 
to Arab populations. A principal co-ordinate analysis, 
based on the same genetic distance matrix, yielded very 
similar results (data not shown). 

On the basis of the frequency of haplogroups, a cor- 
respondence analysis was performed (Figure 4). The 
Berbers from Chenini-Douiret are associated in the plot 
with haplogroups T, and HV, the Tuareg with L2; and 
Mozabites with U6 and V, each at one extreme of the 
plot, whereas the rest of the populations are situated in 
the centre of the plot. A principal component analy- 
sis produced similar results (not shown). Thus, it seems 
that the heterogeneity detected by AMOVA is mostly 
contributed by the Chenini-Douiret sample. 

Discussion 

The mtDNA haplogroup composition of Tunisian 
Berbers reveals a similar picture to that seen in other 
Northern African populations (Rando et al. 1999; Plaza 



Dimension 1 


Figure 3 Multidimensional scaling (MDS) analysis based on the genetic distance matrix 
of North African samples. Abbreviations as in Table 1. 
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et al. 2003), with ~57% of sequences found in hap- 
logroups of broad West Eurasian distribution, ~ 26% 
of sub-Saharan origin, ~ 14% of recent Middle Eastern 
origin, and ~ 3% locally originated in N Africa (that is, 
U6, although the L3 lineage with a transition at 16041 
should probably be added to this category). 

Few L sequences found in Tunisian Berbers were 
described in sub-Saharan African samples (Salas et al. 
2002), but they differ from one or two positions from 
previously described lineages, providing some evidence 
of sub-Saharan admixture. Nevertheless, the group of 
sequences belonging to the L3 group with the substi- 
tution at position 16041 was also found in other North 
African samples, such as Moroccan Arabs, South Mo- 
roccan Berbers, and Algerians, but no match was found 
with sub-Saharan populations. This result points to a 
recent origin for this group of sequences, estimated 
at around 10500 years, rather than to an ancient sub- 
Saharan gene flow. The presence of these L3 lineages 
and the U6 haplogroup suggests that human popula- 
tions in North Africa have experienced several popu- 
lation expansion processes after the occupation of the 
region by anatomically modern humans, and the extant 
populations are not only the result of external migrations 
from the Middle East, sub-Saharan Africa and Southern 
Europe into the area. 

The high frequency of the East African haplogroup 
Ml in Sened is similar to that in Somalians (11%; 
Watson et al. 1997), Nubians (10%; Krings et al. 1999), 


and Ethiopians (10%; Thomas et al. 2002), nonethe- 
less this haplogroup is represented by a single haplotype. 
The same pattern is shown by other Berber samples, 
with single haplotypes reaching high frequencies, such 
as two sequences belonging to the HV1 and T1 hap- 
logroups in Chenini-Douiret, or two U6al sequences in 
the Mozabites. This result suggests that the introduction 
of these sequences was followed by drift, probably due 
to isolation of Berber populations, which led to differ- 
ent haplogroup frequencies and yielded a high degree 
of heterogeneity among Berber groups. 

The three Tunisian Berber populations analysed for 
the mtDNA control region are characterized by high 
genetic heterogeneity, despite their geographic proxim- 
ity and their common culture and history. Although 
they are separated by ~ 100 Km, they reveal similar hap- 
logroups but at different frequencies. Tunisian Berbers 
are more heterogeneous than Moroccan Berbers, since 
an AMOVA within each group shows an apportion- 
ment of the variance in Moroccan Berbers of 2.47% 
(p < 0.01) compared to 6.67% (p < 0.01) in Tunisian 
Berbers. Nonetheless, when Chenini-Douiret Berbers 
are excluded from the analysis, similar values to those 
of Moroccan Berbers are found in Tunisian samples 
(3.97%, p < 0.01; data not shown). Therefore, the anal- 
yses performed place the Berbers from Chenini-Douiret 
as an outlier sample within Berbers, similar to what is 
observed in Mozabites. Nonetheless, the particularity 
of these Berber samples is not the result of the presence 
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of unusual haplogroups and/ or sequences in the region 
(Plaza et al. 2003). Therefore, the outlier position of 
these Berber groups may be the result of a relatively 
recent isolation process that has caused sequence fre- 
quencies to drift to unusual values when compared to 
the rest of samples within the geographical area. 

From the three Tunisian Berber groups, the Berbers 
from Chenini-Douiret seem to be the most geneti- 
cally isolated with a high West Eurasian component in 
their mtDNA gene pool. This result is in accordance 
with studies based on the analysis of the polymorphic 
Gm system (Chaabani & Cox, 1988; Helal et al. 1988) 
of Tunisian populations, which also have a very de- 
fined phylogeography. Sub-saharan African Gm hap- 
lotypes were present at a low frequency (0.07) among 
the Berbers from Chenini-Douiret, whereas the West 
Eurasian haplotypes were much more frequent (0.62). 
The isolation of this Berber group may explain their 
variant haplogroup frequencies, low sequence diversity, 
and lower sub-Saharan African contribution. 

The cultural differentiation present in North Africa 
between Berber and Arab samples seems not to reflect 
genetic differences between both groups, as shown in 
the AMOVA analyses, and the MDS and PC analyses. 
If Arabs in Northern Africa were mostly descendants of 
Middle Eastern Arabs, the frequencies of haplogroups 
such as N, Ul, U3, U7, and HV that are much more 
prevalent in the Middle East than elsewhere should be 
larger in N. African Arabs than in Berbers. However, 
the opposite is observed: these haplogroups add up to 
5% in N. African Arabs but to 10% in Berbers. Drift 
in some of the more isolated Berber populations could 
explain this observation. The lack of differentiation be- 
tween North African Arabs and Berbers has also been 
observed using other genetic markers such as classical 
markers (Bosch et al. 1997); autosomal STRs (Bosch 
et al. 2000), Alu insertion polymorphisms (Comas et al. 
2000); and Y-chromosome lineages (Bosch et al. 2001). 
This pattern suggests that the Arabization of the area was 
mainly a cultural process, rather than a demographic re- 
placement of the Berber populations that inhabited the 
region where the Arabic expansion took place. 

The present data has failed to confirm an east-west 
differentiation of North African populations as previ- 
ously suggested using mtDNA sequences (Plaza et al. 
2003) or other genetic markers (Bosch et al. 1997). The 
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present mtDNA data show a more patchy genetic land- 
scape in North Africa, with some Berber samples acting 
as outliers in the general North African landscape. The 
lack of mtDNA data for large geographic regions like 
the Kabylie (Algeria) and Libya, and the large number of 
isolated Berber samples considered in the present anal- 
ysis may decrease the power to find the longitudinal 
differentiation previously shown by other studies. 
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